Skip to content

Conversation

@hehe7318
Copy link
Contributor

@hehe7318 hehe7318 commented Mar 5, 2025

Description of changes

  • Fix an issue where containerized jobs executed through Pyxis/Enroot in a multi-user environment (integrated with Active Directory) would fail due to permission error like below.
slurmstepd-hades: error: pyxis:     mkdir: cannot create directory ‘/raid’: Permission denied
slurmstepd-hades: error: pyxis:     mkdir: cannot create directory ‘/tmp/enroot-data’: Permission denied
slurmstepd-hades: error: pyxis:     mkdir: cannot create directory ‘/run/enroot’: Permission denied

Tests

  • Manually tests done, I tried below tests in Ubuntu Os ParallelCluster and all of them succeeded:
    1. User A submit Pyxis job
    2. User B submit Pyxis job
    3. User C submit Pyxis job
    4. User ubuntu submit Pyxis job
    5. User A submit Pyxis job again

Checklist

  • Make sure you are pointing to the right branch.
  • If you're creating a patch for a branch other than develop add the branch name as prefix in the PR title (e.g. [release-3.6]).
  • Check all commits' messages are clear, describing what and why vs how.
  • Make sure to have added unit tests or integration tests to cover the new/modified code.
  • Check if documentation is impacted by this change.

Please review the guidelines for contributing and Pull Request Instructions.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

hehe7318 added 2 commits March 5, 2025 16:49
… to prevent permission error when using different AD user to subimit pyxis jobs.
@hehe7318 hehe7318 requested review from a team as code owners March 5, 2025 22:09
@hehe7318 hehe7318 merged commit 65c1b4e into aws:develop Mar 6, 2025
29 of 31 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants